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181 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 



NFkB_CSl 
GGGRQTYYQC 
NFkB-MHC-I . 2 
TGGGCTTCCCC 

241 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

*********************************************************** 

3 01 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 
GGGRQTYYQC 
NFkB_CS2 
RGGGRMTYYCC 

Topo_II_cleavage_site 
RNYNNCNNGYNGKTNYNY 
*★******★*★★*★■*■**■> 

3 61 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 

FIG. 12 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT 7VATCATGGCA TTCACTCAGC 

151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

2 01 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

3 01 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 

3 51 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

4 01 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 

5 51 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
601 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

10 01 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG • CGTCACACAA TTTATTAATG AATTTTTCTA 

12 01 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

13 01 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

13 51 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

14 01 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 

14 51 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 

15 01 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1651 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
1701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
18 01 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2 0 01 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC^ 
2 0 51 _TTTAGAAAGA .AAGAAATGAA^ 

'2T01 ^TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 

2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 

22 01 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 

2251 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 

2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 



FIG, 13 
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24 01 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

24 51 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 

2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

2 551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

2 651 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 

2 701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 

2 751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 

2 801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 

2 851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 

2 901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 

2 951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
3 051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
32 01 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 

FIG. 13 
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MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 
AAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRL 
CERGAKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALR 
GSGAWGLLLRRVGDDVLVHLLARCALFVLVAPSCAYQVCGPPLY 
QLGAATQARP P PHASGPRRRLGCERAWNHS VREAGVPLGL PAPG 
ARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRG 
PSDRGFCWSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPP 
STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRP 
SLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLEL 
LGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEE 
EDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNE 
RRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGC 
VPAAEHRLREEILAKFLHWLMSVYWELLRSFFYVTETTFQKNR 
LFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPAL 
LTSRLRFIPKPDGLRPIVNMDYWGARTFRREKRAERLTSRVKA 
LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPP 
ELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAWQ 
KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWI 
EQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSI 
LSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHA 
KTFLRTLVRGVPEYGCWNLRKTWNFPVEDEALGGTAFVQMPA 
HGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGR 
NMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRF 
HACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSL 
GAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQ 
TQLSRKL PGTTLTALEAAANPAL PSDFKT I LD 

FIG. 17 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 
TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 
GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 
GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 
CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 
AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 
GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 
GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 
CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 
CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 
GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 
CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 
GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 
TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 
CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 
AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 
TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 
TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 
GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 
GTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 
ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 
ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 
TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATCTCG 
-GCGGGG7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 
TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 
AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 
CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCG 
AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 
CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 
GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 
TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCC 
AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC. 
CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 
CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 
GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 
GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGOAAAAAAAAAA 
AAAAAAAAAAAAAAAA 
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MetSerValTyrValValGluLeuLeuArgSerPhePhe 
TyrValThrGluThrThrPheGlnLysAsnArgLeuPhe 
PheTyrArgLysSerValTrpSerLysLeuGlnSerlle 
GlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 
LeuSerGluAlaGluValArgGlnHisArgGluAlaArg 
ProAlaLeuLeuThrSerArgLeuArgPhelleProLys 
ProAspGlyLeuArgProIleValAsnMetAspTyrVal 
ValGlyAlaArgThrPheArgArgGluLysArgAlaGlu 
ArgLeuThrSerArgValLysAlaLeuPheSerValLeu 
AsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAla 
SerValLeuGlyLeuAspAspIleHisArgAlaTrpArg 
ThrPheValLeuArgValArgAlaGlnAspProProPro 
GluLeuTyrPheValLysValAspValThrGlyAlaTyr 
AspThrlleProGlnAspArgLeuThrGluVallleAla 
Serllell eLy s ProGl nAsnThr Ty r Cy sVal ArgArg 
TyrAlaValValGlnLysAlaAlaHisGlyHisValArg 
LysAlaPheLysSerHisValLeuArgProValProGly 
AspProAlaGlyLeuHisProLeuHisAlaAlaLeuGln 
ProValLeuArgArgHisGlyGluGlnAlaValCysGly 
AspSerAlaGlyArgAlaAlaProAlaPheGlyGly 

F/G. 19 

1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 



80 90 

leu lys glu leu val ala arg val leu gin arg leu cys glu arg 

CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 

GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 

ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 

GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



F/G. 20 
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130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAG CTG CCC AAC ACG GTG ACC GAG GGA CTG CGG GGG AGG GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG GTG GTG GTG GGG GGG GTG GGG GAG GAG GTG GTG GTT GAG 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
GTG GTG GGA GGG TGG GGG CTG TTT GTG GTG GTG GGT GGG AGG TGG 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GGG TAG GAG GTG TGG GGG CGG GGG GTG TAG GAG GTG GGG GGT GGG 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT GAG GGG GGG GGG CGG CCA GAG GGT AGT GGA GGG GGA AGG GGT 

200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
GTG GGA TGG GAA GGG GGG TGG AAC GAT AGG GTG AGG GAG GGG GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTG GGG GTG GGG CTG GGA GGG GGG GGT GGG AGG AGG GGG GGG GGG 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GGG AGG GGA AGT GTG GGG TTG GGG AAG AGG GGG AGG GGT GGG 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GGT GCG GGT GAG GGG GAG GGG AGG CCC GTT GGG GAG GGG TGG TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GGG GAG GGG GGG AGG AGG CGT GGA CGG AGT GAG GGT GGT TTG TGT 



280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 

GTG GTG TGA GGT GGG AGA CCC GCG GAA GAA GGG AGG TGT TTG GAG 

290 300 
gly ala leu _ser_gly _thr^_arg_his_ser hj^ 

GGT^GCG~CTC""TCT GGG ACG CGG GAG TGG GAG GGA TGG GTG GGG CGG 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 

GAG GAG GAG GGG GGC CCC CCA TGG AGA TGG GGG GGA GGA GGT GCG 

320 330 

trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 

TGG GAG AGG GGT TGT CGG CGG GTG TAG GGG GAG AGG AAG GAG TTG 
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340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 

CTC TAC TCC TCA GGC GAC AAG GAG GAG CTG CGG CCC TCC TTC CTA 

350 360 

leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 

CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 

GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 

arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 

CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr -gly 

CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 

val leu leu lys thr his cys pro leu arg ala ala val thr pro 

GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 

430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 

GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 

440 450 

ala pro glu glu glu asp thr asp pro arg arg leu val gin leu 

GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 

CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 

ala cys leu arg arg leu val pro pro gly leu trp gly ser arg 

GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 

CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 

-leu-gly— lys— his— ala -l-ys— leu--ser— l-eu~g-ln -giu -leu- t^^^ 

CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 

ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 

gly cys val pro ala ala glu his arg leu arg glu glu ile leu 

GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 
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550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 

GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAG GTC GTC GAG CTG 

560 570 

leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 

CTG AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 
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600 




ile 


gly 


ile 


arg 


gin 


his 
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lys 
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val 


gin 
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glu 
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ATT 


GGA 


ATC 
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610 

ser glu ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
-tyr— phe- -va 1— lys— va 1— asp— va 1— t hr-g 1-y- a 1 a tyr -a sp— t hr^i-le— p r o- 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 



FIG. 20 

(CONTINUED) 



27/103 



760 

his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 
GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 
GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 
CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 
GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 
CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 
ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 
CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 
CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 
GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 
GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 
GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 
GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 
TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 
ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 
CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 
CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 
GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 
-CACAGGCTGGCGCTCGGCT-CCAGGCGAGGGCGAGCTT-TT-GCTCAGGAGGAGeGGGGCTTG 
CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 
CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 
TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 
GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 
AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCACCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCATCTTTGGACTACATCTTAGTCCCGCGCTCACACCTGTGACAGGACTTA 

121 CTCAATGTCTCAGTGTGTGCTGAAACATGTAGAAATTAAAGTCCATCCCTCCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATCCACTGTTTCATTTG 
CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 



3 01 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTTGCTCAGGCTGGAGGGAG 
AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGACCTCCCTC 



361 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCT 
ACGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 

4 21 GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



4 81 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACCATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCTCTGCCCCCACCCCCACCCCAAGTGGTACAACCGGTCCGACCAGAG 

CAP 



541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
CTTGAAGACTGGAGTCTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGTCCA 



£.0.1_-GTGAGCCA.CCATGGGGAGGTGAGAAT-T-mGTeTGTTTAGAAACATCTGGGTCTGAGG ^ 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 

CCAAT 
★ ★★★★****•*•*★★★★> 

661 GAAGCTCACCCCACTCAAGTGTTGTGGTGTTTTAAGCCAATGATAGAATTTTTTTATTGT 
CTTCGAGTGGGGTGAGTTCACAACACCACAAAATTCGGTTACTATCTTAAAAAAATAACA 

721 TGTTAGAACACTCTTGATGTTTTACACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACAATCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 
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CAP 



> 



781 ACACACTAACTGCACCCATAATACTGGGGTGTCTTCTGGGTATCAGCGATCTTCATTGAA 
TGTGTGATTGACGTGGGTATTATGACCCCACAGAAGACCCATAGTCGCTAGAAGTAACTT 

CAP 

84 1 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTTCTGC 
ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



* * ★ 

901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTTGGCTTCTCTGCAGAG 
AAGGTAAAGAAGAGAAGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 

961 AACCAGTGTAAGCTACAACTTAACTTTTGTTGGAACAAATTTTCCAAACCGCCCCTTTGC 
TTGGTCACATTCGATGTTGAATTGAAAACAACCTTGTTTAAAAGGTTTGGCGGGGAAACG 

1021 CCTAGTGGCAGAGACAATTCACAAACACAGCCCTTTAAAAAGGCTTAGGGATCACTAAGG 
GGATCACCGTCTCTGTTAAGTGTTTGTGTCGGGAAATTTTTCCGAATCCCTAGTGATTCC 

1081 GGATTTCTAGAAGAGCGACCCGTAATCCTTAAGTATTTACAAGACGAGGCTAACCTCCAG 
CCTAAAGATCTTCTCGCTGGGCATTAGGAATTCATAAATGTTCTGCTCCGATTGGAGGTC 

1141 CGAGCGTGACAGCCCAGGGAGGGTGCGAGGCCTGTTCAAATGCTAAGCTTCCATAAATAA 
GCTCGCACTGTCGGGTCCCTCCCACGCTCCGGACAAGTTTACGATTCGAAGGTATTTATT 

12 01 AGCAAATTTCCTCCGGCAGTTTCTGGAT^GTAGGAAAGGTTAACATTTAAGGTTGCGTTT 
TCGTTTAAAGGAGGCCGTCAAAGACCTTTCATCCTTTCCAATTGTAAATTCCAACGCAAA 

1261 GTTAGCATTTCAGTGTTTGCCGACCTCAGCTAACAGCATCCCTGCAAGGCCTCGGGAGAC 
CAATCGTTW^GTCACAAACGGCTGGAGTCGATTGTCGTAGGGACGTTCCGGAGCCCTCTG 

1321 CCAGAAGTTTCTCGCCCCTTAGATCCAAACTTGAGCAACCCGGAGTCTGGATTCCTGGGA 
GGTCTTCAAAGAGCGGGGAATCTAGGTTTGAACTCGTTGGGCCTCAGACCTAAGGACCCT 

TopoII 

1381 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 

1441 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 



1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCCACGGACCTGGGGCTCCGACGGGAGGTGGGACACGCCCGCCCTACACTGGTCTACA 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 
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1681 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCCACCAAACGAGTACCACCCCTGGGGAA 

1741 CGCCGCCTGAGAACCTGCAAAGAGAAATGACGGGCCTGTGTCAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGTTCAGCGCC 

1801 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCCAGGGCGCACGGGCAGGTCCCTCGTTACGCA 

1861 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 
GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1921 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 
GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCA.GGCCAGCGGCCAAAGGGTCGCCGC7VCGCACCTGTTCCCAGGGCCTCCACATCATC^ 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2 041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGCCCAATGGGGTGTCGGATCCGGCTAAGCTGGAGAGAGGCGACCCCGGGAGC 

Spl 

2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGC 

2221 CGGGCAACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAGGGACTGGGGACCCGGGCA 
GCCCGTTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTCCCTGACCCCTGGGCCCGT 

Spl 



E2F 

22 81 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGCGCGCCTTGGGGCGGGGCAGGG 

2341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCCAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 

Spl 



E2F NFkB 

★★*★**★★★ 

_24.0.1_-T-T-TCCGCGGGGGGGGGGTG-TGG-TGGGGGGGCGAGTTTeAGGeAGCGeTGCGTCCTGCTGC 
AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTGCGTCGCGACGCAGGACGACG 

hTRT5 ' 

2461 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGGGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGGACG 
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E2F 

2 581 GGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCG 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



★ 

2641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



2 701 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 

Topo_I I__c 1 eavag 



NFkB 
++++++++++ 
NFkB 



Intronl 

2761 GGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 

e site 



2821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2 881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2941 CTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAG 
GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 

3 001 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGC 
GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3 061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGG 

i3±21-GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCa3GCCA^^ 

CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAA 

33 01 GCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 
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3361 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGG 

3 541 ACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3 601 CTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCT 
GAGTCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGA 

3 661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCC 

3 721 GACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

3 781 GGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCC 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3 841 GCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTC 
CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3 901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3 961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 

CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4 021 CCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTT 

GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 



4 081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT 
GTAGAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCA 



4141 GCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 

Intron2 

^4201 "AGGCCCC7VGAGCTGAAT 

TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 



4261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 



4321 GTGATCGAGGTCGAC 
CACTAGCTCCAGCTG 
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Introns 1 2 3 
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gccaagt tcctgcactggctgatgagtgtgtacgtcgtcgagctgctcaggtctttcttt 
tatgtcacggagaccacgtttcaaaagaacaggctctttttctaccggaagagtgtctgg 
agcaagttgcaaagcattggaatcagacagcacttgaagagggtgcagctgcgggacgtg 
tcggaagcagaggtcaggcagcatcgggaagccaggcccgccctgctgacgtccagactc 
cgcttcatccccaagcctgacgggctgcggccgattgtgaacatggactacgtcgtggga 
gccagaacgttccgcagagaaaagagggccgagcgtctcacctcgagggtgaaggcactg 
ttcagcgtgctcaactacgagcgggcgcg 

FIG. 23 



TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 
ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCC 
AGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 
AGGGGCAAGTC 

FIG, 24 



PANEL A 




elution with 
displacement 
oligonucleotide 
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NXT Heparin Affinity Glyc. grad. 




100% 62% 132% 50% 
relative activity/pmol telomerase 
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Glyc. grad. 

fractions pmol RSA 

1 I 



0.5 1.5 4.5 15 
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oooooooooo 

0% 0.4% 12% 17% 13% 0% 0.7% 9% 78% 94% 

% primer extended 
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3' TELOMERASE 5' 



TEMPLATE 

CCAAAACCCCAAAAC 




3' TELOMERASE 5" 



TEMPLATE 

CCAAAACCCCAAAAC • 
,qGGTTTT-3 




PAMELA 



PANEL B 
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1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 

'r3-or 

1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



CCCCAAAACC 
GTAGTTTAGA 
TTTGGATGAT 
GCTCTTGTAG 
TCGCCTTCAT 
CTCGGATGCA 
AAAGCGGAGA 
AATCAGGTAA 
TTATGGAGAA 
CCTAGCCAAC 
GATCTCGATA 
AAAAAGTTTG 
TATGGGTTTT 
TTGAGACAAT 
AGTTCTGATG 
CTCAATTTAA 
TAATGGAATA 
TTGGATTCTT 
TTACAACAGA 
AAAAGAAGCA 
TTGTTGATTC 
AAAGAAGAAG 
AGGTAAGAGA 
CACAGCTGTT 
GTAAAAAGTA 
TCACATTCAT 
^GCAGTCATCC 
AAGAAATGGA 
AAGAATCGAA 
ATTGAAGAGA 
GTAACTTTTA 
CGATCTTCAA 
ATACAAACCT 
AAAGAAAAAA 
AAGATTTATT 
GGGGTTTTGG 



CCAAAACCCC 

AATAAAATAT 

ATAGAAAATT 

TGACAAGAAA 

TGACTATTCC 

AATCTTTATA 

GCAAAGAGTA 

TGAGGATTAT 

AATTACTTAA 

AATGATGAGT 

CATCAGACTT 

ATAATCGAAC 

ATTACAATTG 

TGAAAAAGCT 

TGTATGCCAT 

TGGATAGCTA 

TACGTTAAAT 

AAAGCATAGA 

TTACCTGTTT 

GGCGAAATGA 

TTCTGTAACC 

AGCTATCACA 

GATACATTCA 

ATTTTCTTTT 

TCAAATAAGA 

AGATCGACCT_ 

'GTTTTAAAAA 

GCCGAAATCT 

CTCTAAATCT 

TTGACGAGGC 

TTAATTAGAG 

TTGACGAAAT 

TGGTCAAAAT 

TAAGGCAATA 

TTTTTCAATA 

GG 



AAAACCCCTA 

TATTCCCGCA 

TACTTCCTAA 

GGATGCAAAA 

AAAGTTGCAA 

ACGATTCTTT 

GAAATTGAAA 

TCTATTTTTT 

TACTAAAAGG 

ATATTAAATT 

ACCAAAGACA 

AGCAGAAGAA 

TTTTAGGTAT 

GTTTACAACT 

TATTTTGTGA 

TAGAAACAAA 

CCTTTGGGAC 

TACACAGAAT 

TGATTACTCT 

AAAGAAGACT 

GGAATTAACA 

ATCCTGATTC 

TTAAAATTCA 

ATCTTAACAA 

GAAGCGCTAG 

TCATATATCC_ 

TAGTGCTATG 

TAATCAAAAA 

TTCGTTAATA 

AACTGCACAG 

AATAAACTAA 

AAAAGCTGAA 

ATTGAGGAAG 

AATAAAATGA 

ATTTATTGAA 



TAAAAAAAGA 
CAAATGGAGA 
TACATTCAAC 
CATTGAAATC 
AAACAATTAG 
CTTGAGAAAA 
CATTACTAAT 
AGATCACTTC 
TAAACAGTTT 
CATATGAGAA 
AACTCGCTAT 
CTTATTGCAT 
CGACGGTGAA 
GAAGGAATCG 
ATTAATCTCA 
CCAAATAAAC 
AAATGCACAC 
GCTTTAGAGA 
TGCTCATCTC 
AAAGAAAGAG 
ACAAGAATAT 
TTAAAGATTT 
TATATTATAG 
TATTTTTTGA 
ACTGAGGTAA 
AATACGATGA 



AGGACTAAAT 
GAATTGCGTC 
AGTATTACCA 
AAGATCATTA 
ATTACTAATA 
CTAAAGTTAG 
GAAAAGAAGA 
GTACAGAAGT 
AAGAGGGGTT 



AAAAATTGAG 

TGGATATTGA 

AAGTATAGCA 

TGGCTCGAAA 

AGTTCTACTT 

TTAGTTTTAA 

GTTTAAATAA 

TTAAGGAGCA 

GGATTATTTC 

TGAGTCAAAG 

AAAACGCAAG 

TTACTATTCG 

CTCCCGAGTC 

CAGTTCTGAA 

AATATCTTAT 

CATGCAAGTT 

TGAATTTATA 

CTGATTTAGC 

TTATATCTTT 

ATTTCAAAAT 

TAGCAACGAA 

CAAAAATTCC 

TTTTTCATTT 

TTAGCTGGAA 

CTTAGCTTAT 

TAA GGAAACA„„ 

TTTTAGAGTC 

GATATTGCAA 

ATCTTGATTG 

AAGAAATAAA 

TAGAGATCAG 

ACAATAAAAA 

CCAGTTAGCA 

GAAGAAATAA 

TTGGGGTTTT 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

a PQNPKTPKPL*KKKKLR*FR 

b PKTPKPQNPYKKRKN*GSLE- 

c PKPQNPKTPIKKEKIEVV*K-- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NKILFPHKWRWILIWMI*KI 

b IKYYSRTNGDGY*FG*YRKF- 

c *NIIPAQMEMDIDL DDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTSIAALVVTR KDAK 

b TS*YIQQV*QLL *CQERMQN- 

c LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLARNRLH*LFQSCKNN* 

b lEIWLEIAFIDYSKVAKTIR - 

C LKSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

a SSTSRMQIFITILS*EN*F* 

b VLLLGCKSL*RFFLEKISFK- 

C FYFSDANLYNDSFLRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE.*KLKHY*CLNKIR* -* 
b KRRAKSRN*NITNV*IKSGN- 

,C._ ^ S .G E. ._Q__R_ V _E_I -E--T- .L - L M - F - K - * N Q V -M- - - 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 + + + + + + 420 

ACTCGTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GL FYFLD HFLRSIMEKIT* 

b EDYSIF*ITS*GALWRKLLN- 

c RIILFFRSLLKEHYGENYLI- 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

^ + + + + + 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 



Y*KVNSLDYFPSQQ**VY*I 
TKR*TVWIISLANNDEYIKF- 
LKGKQFGLFP* PTMMSILNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 ^ + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

HMRMSQRISIHQTYQRQTRY 
I*E*VKGSRYIRLTKDKLAI- 
YENESKDLDTSDLPKTNSL *- 



AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

^ ^ + . + + + 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 



KTQE KV* * SNSRRTYCIYYS 
KRKKKFDNRTAEELIAFTIR 
NAR KSLI IEQQKNLLHLLFV 



TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

^ ^ + + + + 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 



YGFYYNCFRYRR*TPES *DN 
MGFI.TIVLGIDGEL PSLETI 
WVLLQLF *VS TVNSRVLRQL 



TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

^ + + + + + 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 



*KSCLQLKE SQF *KFCCVCH 
EKAVYN * .RNRS S ES SDVYA I 
KKLFTTEGIAV LKVLMCMPL 



TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

., 4. 4. + + + 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 



YFVN*SQISYLNLM DS YRNK 
IL*INLKYLISI*WIAI ETN- 
FCE LISNILSQFNG*L.*KQT- 



CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

^ + + . + ----- - ^ 

"GGTTTATTTGGTACGTTCAAATTACCTOATATGCAA^^^ 



PNKPCKFNGIYVKSF GTNAH 
Q I N H A S L M E Y T L N P L G Q M H T 
K*TMQV*WNIR*IIiWDKCTL 



TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

^ + + + h + 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 



* lYIGFLKHRYTECFRD* FS 
EFILDS*SIDTQNALETDLA 
NLYWILKA*IHRML*RLI*L 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

^ + + + + + 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 



a LQQITCFDYSCSSLISLKEA 
b YNRLPVLITLAHLLYL*KKQ 
C TTDYLF*LLLLISYIFKRSR 



GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

^ + + + H + 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 



a GEMKRRLKKE ISKFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

c RNEKKTKERDFKIC*FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS*.F 

b ELTTRILATKKKKSYHNPDS - 

c N*QQEY*QRKRRRAITILIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKR DTFIKIHIL* 

b *RFQKFQVREIHSIiKFIYYS- 

C KDFKNSR*ERYIH*NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCY .FLLS*QYFLISWK- 

C F HFTAVIFFYLNNIF*LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

c KKYQIREALD*GNLAYSHS*- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + — — + 132_^0. 

- - - TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGtCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

C IDLHISN TMIRKQQSSVLKI-. 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + + '"h + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a *CYED*IFRVKKWSRNLNQK 

b SAMRTKFLESRNGAEILIKK- 

c VL*GLNF*SQEMEPKS*SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRR YCKRIEL*IFR**VLP 

b NCVDIAKESNSKSFVNKYYQ- 

c lASIL QKNRTLNLSLISITN- 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILIDCRDCRGNCTEDH*R NK 

b S*LIEEIDEATAQKIIKEIK- 

c LD*LKRLTRQLHRRSLKK*S- 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 + + + + + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN*RIN *ITNIEISDLQ 

b *LLLIRE*TKLLI*RSAIFN- 

C NFY*LE NKLNY*YRDQRSSI- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK*KL N*S*TIKN TNLGQN - 
b *RNKS*TK VRQ*KIQTLVKI- 

c DEIKAELKLDNKKYKP WSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGKEDQLAKEKIRQ *IK* 

b LRKEKKTS *QKKK*GNK*NE - 

c *GRKRRPVSKRKNKAINKMS- 

GTACAGAAGTGAAGAAATAAAt^GATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK*RNKRFIFFNNLL KRGV 

b YRS EEIKDLFFSIIY*KEGF-'- 

c T E JC _K_ *_^K_ I_ Y__F „F_ Q_^_ F_- .I. _ E. „ K- -R — G^ -F 



TTGGGGTTTTGGGGTTTTGGGG 

1741 + + -- 1762 

AACCCCAAAACCCCAAAACCCC 

a L G F W G F G 
b WGFGVLG 
c GVLGFW 
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2 EVDVQNQADNHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNQSQSHYKDL 51 
|::: | :.::|: :| |.::| ::. . | |..M.| 

19 ELELEMQENQNDIQVRVK. . . . IDDPKQY . . LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF.STGLMIELIDKCLVELL 10 0 

: .|.| ..|,. .|: ..:| | ...::|.: 
63 DERRYIITKALL. . . . EVAESDPEFICQLAVYIRNELYIRTTTNYIVAF . 107 

101 S S SDVSDRQKLQCFGFQLKGNQLAKTHLLTALSTQKQYFFQDEWNQVRAM 15 0 

108 CWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYI 144 

151 I GNELFRHLYTKYL I FQRTSEGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 2 0 0 

I :|| , | ..|.... ::: :: 

145 FDATEFKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE. . . PRCCSTCKYNVKNEKDHFLNNINVPNWNNMKSRTRIFYCTHF 247 

.::|| .::|..: . |... ::|. .| ..]:. |.: 
182 SEFNEYQLGKYCTES . . QRKKTMFRYLSVTNKQKWDQTKKK 220 

24 8 NRNNQFFKKHEFVSNKNNISAMDRAQTIFTNIFRFNRIRKKLKDKVIEKI 2 97 

221 . RKENLLTKLQAI KESEDKSKRETG DIMNVEDAI KALKPAVMKKI 2 64 

2 98 AYMLEKVKDFNFNYYLTKSCPLPENWRERKQKIENIilNKTREEKSKYYEE 34 7 

I . . :| :. I ..: |.| |.| :.: 

265 AKRQNAMK KHMKAPKIPNSTLESKYLTFKD 2 94 

348 LFSYTTDNKCVTQFINEFFYNILPKDFLTGRNRKNFQKKVKKYVELNKHE 3 97 

|: I -MM- : :| -I 

295 LIKFCHISEP KERVYKILGKKYPKTEEEYKAAFGDSASAPFN . PE 338 

3 98 LIHKNLLLEKINTREISWMQVETSAKHFYYFDHENIYVLWKLLRWIFEDL 44 7 

I |.. :| .|:| .: . :..|. : : .| ... :|| : : 
33 9 LAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSN, . 3 86 

44 8 WSLIRCFFYVTEQQKSYSKTYYYRKNIWDVIMKMSIADLKKETLAEVQE 4 97 

hi ::.| 

387 ILKAGVSD 394 

4 98 KEVEEWKKSLGFAPGKLRLIPKKTTFRPIMTFNKKIVNSDRKTTKLTTNT 54 7 

11- 

395 TTHS 398 

54 8 KLLNSHLMLKTLKNRMFKDPFGFAVFNYDDVMKKYEEFVCKWKQVGQPKL 5 97 

I :|- |.h--|: 

399 IVINK ICEPKAVENSKM 415 

5 98 FFATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRKNNIVID 64 7 

I :. .. :.|| I :.. :|..| .: :.:|| |: |:: 

416 F. .PLQFFSAIEAVN.EAVTKGFKAKK. . . RENMNLKGQIEAVKE . .WE 4 57 



64 8 SKNFRKKEMKDYFRQKFQKIALEGGQYPTLFSVLENEQNDLNAKKTLIVE 6 97 

||:| ,|t.-: ..:-.| I- -I - 

458 KTDEEKKDM ELEQTEEGEFVKVNEGIGKQYINS lELAIK 496 

698 AKQRNYFKKDNLLQPVINICQYNYINFNGKFYKQTKGIPQGLCVSSILSS 74 7 

497 lAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGL 546 

74 8 FYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFI 797 

: . |.||: ::: : 

54 7 MVKQRCEKSSFYIFSSPSSQCNKCYLEVDL 576 

FIG. 36 
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798 EKLINVSRENGFKFNMKK. LQTSFPLSPSKFAKYGMDSVEEQNIVQDYCD 846 
577 PGDELRPSMQKLLQEKGKLGGG. . TDFPYECIDEWTKNKTHVD 617 

84 7 WIGISIDMKTLALMPNINLRIEGILCTLNLNMQTKKASMWLKKKLKSFLM 896 

|.| II-- - -:|hl -h IM- : 

618 NIVILSDMMIAEGYSDINVRGSSIVNSI KKYKDEVN 653 

897 NNITHYFRKTITTEDFANKTLNKLFISGGYKYMQCAKEYKD.HFKKNLAM 945 

||. : |::: |::. :.|:.: :: | ::| 

654 PNIKIF. . . AVDLEGYG KCLNLGDEFNENNYIKIFGM 687 

94 6 SSMIDLEVSKIIYSVTRAFFKYLVCNIKDTIFGEEHYPDFFLSTLKHFIE 995 

|. I :|:: ... :. :::| 
688 SDSI LKFISAKQGGA NMVE 706 

996 IFSTKKYIFNRVC 1008 

707 VI . . KNFALQKIG 717 



FIG. 36 

(CONTINUED) 



132 LSTQKQYFFQDEWNQVRAMIGNEL.FRHLYTKYLIFQRTSE. . GTLVQFC 178 
:| ..| ....| MM : :. | | | . 

1 MSRRNQ KKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQI 4 3 

17 9 GNNVFDHLKVNDKFDKKQKGGAADMNEPRCCSTCKYNVKNEKDHFLNNIN 2 2 8 
:: :. ||..:. :. ..|...| :\ \ M.-:-:|--l 

4 4 KEEDLKLLKFKNQDQDGNSGNDDDDEE NNSNKQQELLRRVN 84 

22 9 VPNWNNMKSRTRI FYCTHFNRNNQFFKKHEFVSNPCNNI SAMDRAQTI FTN 2 7 8 

|:MI h-l : • -I 

8 5 QIKQQVQLIKK. , . VGSKVEKDLNLNEDENKKN 114 

2 7 9 I FRFNRIRKKLKDKVIEKI AYMLEKVKDFNFNYYLTKSCPLPENWRERKQ 3 2 8 

I- - : I l-^l-- - -11 - 

115 GLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRRETDY 164 

32 9 KIENLINKTREEKSKYYEELFSYTTDNKCVTQFINE . FFYNILPKDFLTG 3 77 

.M-::: --I -I- = 1= I : Ih- 

16 5 DTEKWFEISHDQK NYVS lYANQKTSYCWWLKDYFNK 2 0 0 

3 78 RNRKNFQKKVKKYVELNKHELIHKNLLLEKINTREISWMQVETSAKHFYY 42 7 

M h- = : -M : h I-- 

201 NNYDHLNVSINRLE. , TEAEFYAFDDFSQTIKLTNNSYQTVNID 242 

4 28 ~ TdHE^Ti Y\^Wia^I^WI . . f¥dL^VVSLIRCFFYVTEQQKSYS KTYYYRKNI 4 7 5 

.: :\ IM- M ^ : Ih : I -I --I- - 

24 3 VNFDNNLCILALLRFLLSLERFNILNIRSSY. .TRNQYNFEKIGELLETI 2 90 

4 7 6 WDVIMKMS I ADLKKETLAEVQEKEVEEWKKSLGFAPGKLRL I PKKTTFRP 52 5 

:M:- . -:h • I- I-- 

2 91 FAWFSHR HLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQ 33 0 

52 6 IMTFNKKIVNSDRKTTKLTTNTKLLNSHLMLKTLKNRMFKDPFGFAVFNY 575 

: |...: I |.h - hi-. -I- • • II I- 

331 VYSFSTDLKLVD. . TNKVQDYFKFLQEFPRLTHVSQQAI PVSATNAVENL 378 

FIG. 37 
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57G DDVMKKYEEFVCKWKQVGQPKLF FATMDIEKCYDS . . VNREK 615 

: .:|| :. I |. | :. :.:| :.::| 

3 7 9 NVLLKKVKH ANLNLVS I PTQFNFDFYFVNLQHLKLEFGLEPNILTKQK 4 2 6 

516 LSTFL KTTKLLSSDFWIMTAQILKRKNNI . . VIDSKNFRKKEMK 657 

|...| |. |:| |:. M II. : ... ||::... . 

42 7 LENLLLSIKQSKNLKFLRLNFYTYVAQETSRKQILKQATTIKNLICNNKNQ 4 76 

558 DYFRQKFQKI ALEGGQYPTLFSVLEN . . EQNDLNAKKTLIVEAKQRNYFK 705 

... h III :!:••• ::!■! I 

477 EETPETKDETPSESTSGMKFFDHLSELTELEDFSVN. . . .LQATQEIY. . 520 

706 KDNLLQPVINICQYNYINFNGKFYKQTKGIPQGLCVSSILSSFYYATLEE 755 

III •! :• : ■•■I l-lh 

521 .DSLHKLLIRSTNLKKFKLSYKYEMEKSKMDTFIDLPCNI YETLNN 564 

756 SSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFIEKLINVSR 3 05 

|: I I .||: I ||: ||. 
565 LKRCSVNISNPHGNISYELTN KDSTFYKFKLTLNQE 500 

8 0 6 ENGFKFNMKKLQTSFPLS PSKFAKYGMDSVEEQNIVQDYCDWIGI S IDMK 8 5 5 

|:.:| |..|.:. | ||.: .|:|. : ::..|..|: : ::. 
601 LQHAKYTFK. . QNEFQFNNVKSAKIESSSLESLEDIDSLCKSI ASCKNLQ 648 

856 TLALMPNINLRIEGILCTLNLNMQT. .KKASMWLKK. . KLKSFLMNNITH 901 

I ::|:. |:.... |.. :::|. .||.: .|. 
649 NVNI lASLLYPNNIQKNPFNKPNLLFFKQFEQLKNLENVSINC 691 

902 YFRKTI. . , TTEDFANKTLNKLFI SGGYKYMQCAKEYKDHFKKNLAMSSM 948 

. I I I II I .:|: :|.. ||- 

6 92 ILDQHILNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPEL 741 

949 IDLEVSKIIYSVT RAFFKYLVCNI KDT . . IFGEEHY 982 

:: ... : | :||:. |- ||:. .:- 

74 2 NQVYINQQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDF 791 

983 PDFFLS TLKHFIEIFSTKKY I FNRVCMILKAKEAKLKSDQCQSLIQ 1028 

.: :| .| I :|..|| . |: ... | ..::.|.|:. 

792 DQNTVSDDSIKKILESISESKYHHYLRLNPSQSSSIilKSENEEIQELLK 84 0 

FIG. 37 

(CONTINUED) 



4 DIDLDDIENLLPNTFNKYSSSCSDKKGCKTLKSGSKSPSLTIPK 47 

::. ..||. |.. :||.|.... :|| .|. 

617 NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNP 666 

48 LQKQLEFYFSDANLYNDSFLRKLVLKSGEQRVE . . . . lETLLM 86 

: ||:| . . .|: :::| :|.| .: :| j..::: 
667 FNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEFLEKNKKIKAFIL 716 

FIG. 38 
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1 MEMDIDLDDIENL LPNTFNKYSSSCSDKKGCKTLKSGSKSPS . . . 4 2 

|: |. . .11 .,.|-. |:| |...:. . | || 

4 91 lELAIKIAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGAKKYGSVRTCLEC 54 0 

4 3 . LT I PKLQKQ LEFYFSDANL YNDS FLRKLVLKSGEQRVE I ETLL 8 5 

|.: : II : :: |... :|.::| |.::| ].-::. M 

541 ALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL .EVDLPGDELRPSMQKLL 589 

FIG. 39 



telomerase p4 3 
human La 

Xenopus LaA I 
Drosophila La I 
S . c . Lhplp 



LQK3LEF YF SDANLYN IDS FL RKLVLKSGEQRVE lETLLM 
ICH3UEYYFGDFISILPRDKFLKEQI . KLDEGWVPLEIMIK 
CE 31 EY YFGDHNLPR 3KFL KQQI . LLDDGWVPLETMIK 
LR3VEYYFGDANLNRDKFLREQIGKNEDGWVPLSVLVT 
CLK3VEFYFSEFISIFPYDRFLRTTAEK.NDGWVPISTIAT 
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aagcagtaaa 
gaaaacaact 
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gaaagaagat 
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aaagcagtac 
atttttgatg 
cgtaaggaac 
ttcaacgaat 
cgttacctct 
aatctcttaa 
ggagacataa 
atagccaaga 
accttggaat 
cctaaagaaa 
aaagcagcct 
atgaagattg 
gaggtttggg 
t tgtctaaca 
atttgtgagc 
attgaagctg 
aatcttaaag 
aaagatatgg 
ggcaagcaat 
gatgaaatca 
a t g t c a gg t g 
cttggtttga 
agttctcaat 
tctatgtaaa 
gagtgcattg 
gatatgatga 
agcatcaaaa 
gaaggttacg 
atattcggta 
atggtcgaag 
gattcttcta 
atttaaatta 
gcctattcaa 



taaaacccaa 
aagcacaata 
accaaaatga 
tcactgcagc 
atatcatcac 
tggcagtcta 
tttgtgttgt 
ttttgcctaa 
caactgaatt 
tcactttccg 
actaacttgg 
cagttaccaa 
ccaaacttta 
tgaacgttga 
gatagaatgc 
caaagtactt 
gagtctataa 
ttggtgattc 
aaatctctaa 
ataatttaat 
tcttaaaagc 
ccaaggccgt 
ttaatgaagc 
gtcaaatcga 
agttggagta 
acattaactc 
aaggacacac 



"gag cc"a"ag a a 
tggtaaaata 
gcaataagtg 
aacttttgca 
atgaatggac 
ttgcagaagg 
agtacaagga 
gaaagtgcct 
tgagcgattc 
ttatcaaaaa 
taacaaaaat 
ttatctattt 
atgattctgc 



FIG. 42 



50/103 



4-> 

O 



m ir\ 

fN) CM 



o 

2 



! M M 

S Q W 



00 u> r-- 00 



O 



O O CM 1-t 



CD 



4-1 
-H 



O 



ro i-H 

CN a 

rH P' 

CU O 

^ u 

0) w cn 

to to iz; ^ 

to ^ i-q • 

c <u ^ u E-t • 

to o aico I 

cj rH d > tn 

O (1) O rH M 00 

U 4-> Q fO ffi 



CM 



o 



9 



XI ffi 







in 




oo 




rH rH 


in 
1 


1 






ro 


hH 

M 








tN 










rH 










Ch 


o 










in 










5? 






to 


to s 










fU M 






CN 


to 








rH 




QJ — 


u 


Eh 




(U 


B 




Pi 


m 


to 


o mco 


1 






rH C 




> 


LD 


o 


0) o 


iH 


M 


00 


u 


4-> Q 









51/103 

MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQIiA 
VYIRNELYIRTTTNYIVAFCVVHKNTQPFIEKYFNKAVLLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKILGKKYPKTEEEYKAAFGDSASAPFNPELAGKRMKIEISKTW 
ENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQIEAVKEWEKTDEEKKDMELEQTEEGEFVKVNEGIG 
KQYINSIELAIKIAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGA 
KKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEG YGKCLNLGDE FNENNY I KI FGMSDS I LKF I S AKQGGANM 
VEVIKNFALQKIGQK 

FIG. 43 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVSIYANQKTSYCWWLKDYFNKNNYDHLNVSINRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETIFAWFSHRHLQGIHLQVP 
CEAFQYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAIPVSATNAVENLNVLLKKVKHANLNLVSIP 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 45 



MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
N YTV I Q FNGQ F FTQ I VGNRCNE PHL P P KW VQRS S S S S ATAAQ I K 
QLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
I F PTNLVKI PQRLKVR I NLTLQKLLKRHKRLN YVS I LNS I C P PL 
EGTVLDLSHLSRQSPKERVI^KFII^ILQK^LPQEMF^ 

"IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 

NFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTIVYFR 
HDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRI 
IPKKSNNEFRIIAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KLFNWNASRVPKPYELYIDNVRTVHLSNQDVINWEMEIFKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTL I LKLADD FL 1 1 S TDQQQ V IN I KKLAMGGFQK YNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
SKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHWKNISE 
CYKSAFKDLSINVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIE 
YEVRFTILNGFLESLSSNTSKFKDNIILLRKEIQHLQAYIYIYI 
HIVN 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 

FIG. 47 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGC^AAAG C^C AATAG^^ 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIG. 48 
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AKFLHWLMSVYWELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVS E AE VRQHREARP ALLTS RLRF I PKPDGLRP I VNMD Y WGART FRRE K 
RAERLTSRVKALFSVLNYERA 

FIG. 49 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 

FIG. 50 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLISILESKNWQLLLEIIGSDAMHYLLSKGSXFEALPNDNYLQISGIPLFKNNVFEETVSKKRKR 
TIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 
LINAFQVKQLHKVIPLVSQSTWPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTN 
QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 

FR-I-VK-K-KtKDPEFVrRKYATTHATSDRATKNFVSEAFSYFDlWPF^ 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 
I YRLG YSMCMRAQAYLKRMKD I F I PQRMF I TDLLNV I GRKI WKKLAE I LGYTSRRFLS S AE VKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 



FIG. 51 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1. Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



03- 



Telomerase Gene::M 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



_WildType_(X2). 



2. Assay in selective media 

3. Sporulate, and grow on 
selective media 




- ^Senescence?-(X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 65 
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heterozygote diploid 
wt diploid 
wt haploid 



Tetrad 
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600 bp 
500 bp 
400 bp 

300 bp 
200 bp 

100 bp 
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1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAG GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TOT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40' * ^ 50 
gly ile "arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
G7^ GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

._1.6„0„. 17_0_ ^ 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 



FIG. 68 
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210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC OCA AGO CCT TCA AGA GCC ACG TCC TAG GTC CAG TGC 

220 230 

gin gly ile pro gin gly ser ile leu ser thr leu leu cys ser 
CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAG GGC GAG ATG GAG AAC AAG CTG TTT GCG GGG ATT CGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 
CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 
TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG -AGC GAC TAG TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGG GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 

^2 J 0 ^ —3 8 0 -~ 



arg leu lys cys his ser leu phe leu asp leu gin val asn ser 
CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 

390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAG AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAG AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 



FIG. 68 

(CONTINUED) 
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420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 
CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
-AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT— 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 

FIG. 68 

(CONTINUED) 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



. LWSLIRCFFYVTEQQKSYSKT . 
, FIIPILQSFFYITESSDLRNRT . 
, LIPKIIQTFFYCTEISSTVTIV. 
, YWELLRSFFYVTETTFQKNRL . 
FFY TE 



Motif 0 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



K 

p hhh K hR h R 

. KSLGFAPGKLRLIPKKT- -TFRPIMTFNKKIV 
. QKTTLPPAVIRLLPKKN- -TFRLITNLRKRFL 
. TLSNFNHSKMRI I PKKSNNEFRI lAI PCRGAD 
. ARPALLTSRLRFIPKPD--GLRPIVNMDYWG 
R PK R I 



Motif A 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 

Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



AF 

h hDh GY h 
. PKLFFATMDIEKCYDSVNREKLSTFLK, 
. RKKYFVRIDIKSCYDRIKQDLMFRIVK. 
. PELYFMKFDVKSCYDSIPRMECMRILK. 
. PELYFVKVDVTGAYDTIPQDRLTEVIA. 

F D YD 

hPQG pS hh 
.NGKFYKQTKGIPQGLCVSSILSSFYYA. 
. GNSQYLQKVGIPQGSILSSFLCHFYME . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD , 
. RATSYVQCQGIPQGSILSTLLCSLCYG . 
G QG S 



.// 



Motif C 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



Y 

h F DD hhh 
. PNVNLLMRLTDDYLLITTQENN. . . 
. KKGSVLLRWDDFLFITVNKKD. . . 
. SQDTLILKLADDFLIISTDQQQ. . . 
. RRDGLLLRLVDDFLLVTPHLTH . . , 
DD L 



Motif D 
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Sp Tezl 
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consensus 
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1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 

~2ro'i~ 

2151 
2201 
2251 
2301 
2351 
2401 
2451 



GCAGCGCTGC 
CCGCGATGCC 
AGCCACTACC 
GCCCCAGGGC 
CGNTGGTGGC 
CCCGCCGCCC 
CCGAGTGCTG 
TCGGCTTCGC 
ACCACCAGCG 
GGGGAGCGGG 
TGGTTCACCT 
TGCGCCTACC 
TCAGGCCCGG 
CAACGGGCCT 
CCAGCCCCGG 
GTTGCCCAAG 
CCGTTGGGCA 
GACCGTGGTT 
CTCTTTGGAG 
GCCGCCAGCA 
GGGACACGCC 
TCCTCAGGCG 
AGGCCCAGCC 
TTCCAGGCCT 
GCGNTACTGG 
CGCAGTGCCC 
GCGGTCACCC 
TGTGGCGGCC 
CTGCTCCGCC 
CTGCCTGCGC 
AACGCCGCTT 
GCCAAGCTCT 
CGCTTGGCTG 
ACCGTCTGCG 
GTGTACGTCG 
GTTTCAAAAG 
TGCAAAGCAT 
CTGTCGGAAG 
GACGTCCAGA 
TGAACATGGA 
GCCGAGCGTC 
CGAGCGGGCG 
"ACGATAT'CCA 
GACCCGCCGC 
CGACACCATC 
AACCCCAGAA 
GCCCATGGGC 
AGACCTCCAG 
GCCCGCTGAG 
GCCAGCAGTG 



GTCCTGCTGC 

GCGCGCTCCC 

GCGAGGTGCT 

TGGCGGCTGG 

CCANTGCNTG 

CCTCCTTCCG 

CANANGCTGT 

GCTGCTGGAC 

TGCGCAGCTA 

GCGTGGGGGC 

GCTGGCACGC 

ANGTGTGCGG 

CCCCCGCCAC 

GGAACCATAG 

GTGCGAGGAG 

AGGCCCAGGC 

GGGGTCCTGG 

TCTGTGTGGT 

GGTGCGCTCT 

CCACGCGGGC 

TTGTCCCCCG 

ACAAGNACAC 

TGACTGGCGT 

TGGATGCCAG 

CAAATGCGGC 

CTACGGGGTG 

CAGCAGCCGG 

CCCGAGGAGG 

AGCACAGCAG 

CGGCTGGTGC 

CCTCAGGAAC 

CGCTGCAGGA 

CGCAGGAGCC 

TGAGGAGATC 

TCGAGCTGCT 

AACAGGCTCT 

TGGAATCAGA 

CAGAGGTCAG 

CTCCGCTTCA 

CTACGTCGTG 

TCACCTCGAG 

CGGCGCCCCG^ 

CAGGGCCTGG 

CTGAGCTGTA 

CCCCAGGACA 

CACGTACTGC 

ACGTCCGCAA 

CCGTACATGC 

GGATGCCGTC 

GCCTCTTCGA 



GCACGTGGGA 
CGCTGCCGAG 
GCCGCTGGCC 
TGCAGCGCGG 
GTGTGCGTGC 
CCAGGTGTCC 
GCGANCGCGG 
GGGGCCCGCG 
CCTGCCCAAC 
TGCTGCTGCG 
TGCGCGNTNT 
GCCGCCGCTG 
ACGCTANTGG 
CGTCAGGGAG 
GCGCGGGGGC 
GTGGCGCTGC 
GCCCACCCGG 
GTCACCTGCC 
CTGGCACGCG 
CCCCCATCCA 
GTGTACGCCG 
TGCGNCCCTC 
TCGGGAGGTT 
GATTCCCCGC 
CCCTGTTTCT 
TTCCTCAAGA 
TGTCTGTGCC 
AGGAACACAG 
CCCCTGGCAG 
CCCCAGGCCT 
ACCAAGAAGT 
GCTGACGTGG 
CAGGGGTTGG 
CTGGCCAAGT 
CAGGTCTTTC 
TTTTCTACCG 
CAGCACTTGA 
GCAGCATCGG 
TCCCCAAGCC 
GGAGCCAGAA 
GGTGAAGGCA 
GCCTCCTGGG 
"CGCACCTTCG 
CTTTGTCAAG 
GGCTCACGGA 
GTGCGTCGGT 
GGCCTTCAAG 
GACAGTTCGT 
GTCATCGAGC 
CGTCTTCCTA 



AGCCCTGGCC 
CCGTGCGCTC 
ACGTTCGTGC 
GGACCCGGCG 
CCTGGGANGN 
TGCCTGAANG 
CGCGAANAAC 
GGGGCCCGCG 
ACGGTGACCG 
CCGCGTGGGC 
TTGTGCTGGT 
TACCAGCTCG 
ACCCGAANGC 
GCCGGGGTCC 
AGTGCCAGCC 
CCCTGAGCCG 
GCAGGACGCC 
AGACCCGCCG 
CCACTCCCAC 
CATCGCGGCC 
AGACCAAGCA 
CTTCCTACTC 
CGTGGAGACA 
AGGTTGCCCC 
GGAGCTGCTT 
CGCAGTGCCC 
CGGGAGAAGC 
ACCCCCGTCG 
GTGTACGGCT 
CTGGGGCTCC 
TCATCTCCCT 
AAGATGAGCG 
CTGTGTTCCG 
TCCTGCACTG 
TTTTATGTCA 
GAAGAGTGTC 
AGAGGGTGCA 
GAAGCCAGGC 
TGACGGGCTG 
CGTTCCGCAG 
CTGTTCAGCG 
_CGC^CTCTGTG_ 
TGCTGCGTGT 
GTGGATGTGA 
GGTCATCGCC 
ATGCCGTGGT 
AGCCACGTCT 
GGCTCACCTG 
AGAGCTCCTC 
CGCTTCATGT 



CCGGCCACCC 
CCTGCTGCGC 
GGCGCCTGGG 
GCTTTCCGCG 
T^GGCNGCCC 
ANCTGGTGGC 
GTGCTGGCCT 
CGAGGCCTTC 
ACGCACTGCG 
GACGACGTGC 
GGNTCCCAGC 
GCGCTGCNAC 
GTCTGGGATC 
CCCTGGGCTG 
GAAGTCTGCC 
GAGCGGACGC 
TGGACCGAGT 
AAGAAGCCAC 
CCATCCGTGG 
ACCACGTCCT 
CTTCCTCTAC 
AATATATCTG 
NTCTTTCTGG 
GCCTGCCCCA 
GGGAACCACG 
GCTGCGAGCT 
CCCAGGGCTC 
CCTGGTGCAG 
TCGTGCGGGC 
AGGCACAACG 
GGGGAAGCAT 
TGCGGGACTG 
GCCGCAGAGC 
GCTGATGAGT 
CGGAGACCAC 
TGGAGCAAGT 
GCTGCGGGAG 
CCGCCCTGCT 
CGGCCGATTG 
AGAAAAGAGG 
TGCTCAACTA 
CTGGGCCTGG 



GCGGGCCCAG 
CGGGCGCGTA 
AGCATCATCA 
CCAGAAGGCC 
CTACCTTGAC 
CAGGANAACA 
CCTGAATGAG 
GCCACCACGC 
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2 5 01 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 

2 551 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 

2 601 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 

2 651 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 

2 701 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 

2 751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 

2 801 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 

2 851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 

2 901 CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 

2 951 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 

3 001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
3 051 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
3 2 01 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
3 2 51 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3 3 01 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
3 351 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
3401 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 
3 451 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
3 501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
3 551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3 601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
3 651 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
3 701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
3 751 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3 801 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
3 851 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3 901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 

3 951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 

4 001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 

X + + + + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

a AALRPAAHVGSPGPGHPRDA 

b QRCVLLRTWEALAPATPAMP- 

c SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 

61 + + + + + + 120 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

a ARSPLPSRALPAAQPLPRGA 

b RAPRCRAVRSLLRSHYREVL- 

C ALPAAEPCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

121 + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

a AAGHVRAAPGAPGLAAGAAR 

b PLATFVRRLGPQGWRLVQRG- 

C RWPRSCGAWGPRAGGWCSAG- 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCANTGCNTGGTGTGCGTGCCCTGGGANGN 

181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

a GPGGFPR?GGP??GVRALG? 

b DPAAFRA?VA?C?VCVPW??- 

C TRRLSARWWPPAWCACPG??- 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + + + + + 300 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

a ?AAPRRPLLPPGVLPE??GG 

b ??PPAAPSFRQVSCL??LVA- 

C G?PPPPPPSARCPA*??WWP- 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 

301 + + + + + + 360 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 

a PSAA?AVR?RRE?RAGLRLR 

b RVL??LC?RGA?NVLAFGFA- 

C ECC??CA?AAR?TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

361 + + + + + + 420 

_ eGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT — 

a AAGRGPRGPPRGLHHQRAQL 

b LLDGARGGPPEAFTTSVRSY- 

C CWTGPAGAPPRPSPPACAAT- 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

a PAQHGDRRTAGERGVGAAAA 

b LPNTVTDALRGSGAWGLLLR- 

c CPTR*PTHCGGAGRGGCCCA- 
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a PRGRRRAGSPAGTLR??CAG 

b RVGDDVLVHLLARCA?FVLV- 

c AWATTCWFTCWHAAR?LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 
541 + + + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

a GSQLRLP7VRAAAVPARRC? 

b ?PSCAY?VCGPPLYQLGAAT- 

C ?PAAPT?CAGRRCTSSAL?L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 

601 + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

a SGPAPATR?WTR?RLGSNGP 

b QARPPPHA?GPE?VWDPTGL- 

C RPGPRHTL?DP?ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

661 + + -I- + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

a GTIASGRPGSPWAASPGCEE 

b EP*RQGGRGPPGLPAPGARR- 

c NHSVREAGVPLGCQPRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

721 + + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

a ARGQCQPKSAVAQEAQAWRC 

b RGGSASRSLPLPKRPRRGAA- 

c AGAVPAEVCRCPRGPGVALP- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + -I- + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

a P*A GADARWAGVLGPPGQDA 

b PEPERTPVGQGSWAHPGRTP- 

C LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 + + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

a WTE*PWFLCGVTCQTRRRSH 

b GPSDRGFCVVSPARPAEEAT- 

C DRVTVVSVWCHLPDPPKKPP- 



CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

a LFGGCAIiWHAPLPPIRGPPA 

b SLEGALSG TRHSHPSVGRQH- 

c LWRVRSLARATPTHPWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 

961 + 4- + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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a PRGPPIHIAATTSWDTPCPP 

b HAGPPSTSRPPRPGTRLVPR- 

C TRAPHPHRGHHVLGHALSPG- 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGNACACTGCGNCCCTC 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTGTTCNTGTGACGCNGGG^ 

a VYAETKHFLYSSGDK?TA?L 

b CTPRPSTSSTPQAT?TLRPS- 

C VRRDQALPLLLRRQ?HC?PP- 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTTCGTGGAGACA 

1081 + + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

a LPTQYI*GPA*LAFGRFVET 

b FLLNISEAQPDWRSGGSWR?- 

C SYSIYLRPSLTGVREVRGD?- 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCTGCCCCA 

1141 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

a PFLVPGLGCQDSPQVAPPAP 

b SFWFQALDARIPRRLPRLPQ- 

c LSGSRPWMPGFPAGCPACPS- 

GCGOTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCC 

1201 + + + + + + 1260 

CGCNATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACGGG 

a A7LANAAPVSGAAWEPRAVP 

b RYWQMRPLFLELLGNHAQCP- 

C 7TGKCGPCFWSCLGTTRSAP- 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGG 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGA03GGCGACGCTCGACGCCAGTGGGGTCGTCX3GCC 

a LRGVPQDALPAASCGHPSSR 

b YGVFLKTHCPLRAAVTPAAG- 

C TGCSSRRTARCELRSPQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTGTGTC 

a CLCPGEAPGLCGGPRGGGTQ 

b VCAREKPQGSVAAPEEEEHR- 

'c S_ y_ PGR S P R A___L_ W R P ^ R R R N T D - 



ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 

1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

a TPVAWCSCSASTAAPGRCTA 

b PPSPGAAAPPAQQPLAGVRL- 

C PRRLVQLLRQHSSPWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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a SCGPACAGWCPQASGAPGTT 

b RAGLPAPAGAPRPLGLQAQR- 

C VRACLRRLVPPGLWGSRHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCT 

1501 + + + + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCCTTCGTACGGTTCGAGA 

a NAASSGTPRSSSPWGSMPSS 

b TPLPQEHQEVHLPGEACQAL- 

C RRFLRNTKKFISLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCC 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

a RCRS*RGR*ACGTALGCAGA 

b AAGADVEDERAGLRLAAQEP- 

C LQELTWKMSVRDCAWLRRSP- 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGT 

1621 + + + + + + 1680 

GTCCCCAACCXSACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

a QGLAVFRPQSTVCVRRSWPS 

b RGWLCSG-RRAPSA*GDPGQV- 

C GVGCVPAAEHRLREEILAKF- 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCA 

1681 + + + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

a SCTG**VCTSSSCSGLSFMS 

b PALADECVRRRAAQVFLLCH- 

C LHWLMSVYVVELLRSFFYVT- 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGT 

1741 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGATGGCCTTCTCACAGACCTCGTTCA 

a RRPRFKRTGSFSTGRVSGAS 

b GDHVSKEQALFLPEECLEQV- 

C ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAG 

1801 + + + + + + 1860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTCGACAGCCTTC 

a CKALESDST*RGCSCGSCRK 

b AKHWNQTALEEGAAAGAVGS- 

c __S^ G 1_ R Q _ H L K R V Q L _R EL SE A ^ 



CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 -f + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

a QRSGSIGKPGPPC*RPDSAS 

b RGQAASGSQARPADVQTPLH- 

C EVRQHREARPALLTSRLRFI- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 

1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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a SPSLTGCGRL*.TWTTSWEPE 

b PQA*RAAADCEHGLRRGSQN- 

C PKPDGLRPIVNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCXSTCTCACCTCGAGGGTGAAGGCACTGTTCAGCG 

1981 + + + + + + 2040 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCCACTTCCGTGACAAGTCGC 

a RSAEKRGPSVSPRG*RHCSA 

b VPQRKEGRASHLEGEGTVQR- 

c FRREKRAERLTSRVKALFSV- 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGG 

2041 + + + -H + + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

a CSTTSGRGAPASWAPLCWAW 

b AQLRAGAAPRPPGRLCAGPG- 

C LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + -t- + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

a TISTGPGAPSCCVCGPRTRR 

b RYPQGLAHLRAACAGPGPAA- 

C DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACA 

2161 + + + + + + 2220' 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

a LSCTLSRWM*RARTTPSPRT 

b *AVLCQGGCDGRVRHHPPGQ- 

C ELYFVKVDVTGAYDTIPQDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCA^CCCCAGAACACGTACTGCGTGCGTCGGT 

2221 + + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGGGGTCTTGTGCATGACGCACGCAGCCA 

a GSR RSSPASSNPRTRTACVG 

b AHGGHRQHHQTPEHVLRASV- 

C LTEVIASIIKPQNTYCVRRY- 

ATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

2281 + + + + + + 2340 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 

a MPWSRRPPMGTSARPSRATS 

b CRGPEGRPWARPQGLQEPRL- 

c A_ V y _Q_ „K_ _A_.A_H_„G_H_ _V _R„_K_ _A_ .F_K. _S„ H _ V__S--- 



CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 

2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTCAAGCACCGAGTGGACGTCCTNTTGT 

a LP*QTSSRTCDSSWLTCR?T 

b YLDRPPAVHATVRGSPAG?Q- 

C TLTDLQPYMRQFVAHLQ7NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

2401 + + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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a AR*GMPSSSSRAPP*MRPAV 

b PAEGCRRHRAELLPE*GQQW- 

C PLRDAVVIEQSSSLNEASSG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGT 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 

a ASSTSSYASCATTPCASGAS 

b PLRRLPTLHVPPRRAHQGQV- 

C LFDVFLRFMCHHAVRIRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCC 

2521 + + + -I- + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACX5AGACGTCGG 

a PTSSARGSRRAPSSPRCSAA 

b LRPVPGDPAGLHPLHAALQP- 

c YVQCQGIPQGSILSTLLCSL- 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCC 

2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

a CATATWRTSCLRGFGGTGCS 

b VLRRHGEQAVCGDSAGRAAP- 

C CYGDMENKLFAGIRRDGLLL- 

TGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCC 

2641 + + + + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCTTTTGGAAGG 

a CVWWMISCW + HLTSPTRKPS 

b AFGG*FLVGDTSPHPRENLP- 

C RLVDDFLLVTPHLTHAKTFL- 

TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAG 

2701 + + + + + + 2760 

AGTCCTGGGACCAGGCTCCACAGGGACTCATACCGACGCACCACTTGAACGCCTTCTGTC 

a SGPWSEVSLSMAAW*TCGRQ 

b QDPGPRCP*VWLRGELAEDS- 

C RTLVRGVPEYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGG 

2761 + + + + 4- + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTACGGCC 

a W*TSL*KTRPWVARLLFRCR 

b GELPCRRRGPGWHGFCSDAG- 

c __V N F_P B_D E___A L_G^G__T A_ F__V Q M P^A^^„ 



CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

2821 + + + + + + 2880 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

a PTAYSPGAACCWIPGPWRCR 

b PRPIPLVR.PAAGYPDPGGAE- 

C HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTGGAAGTTGGCGCCGA 
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a ATTPAMPGPPSEPVSPSTAA 

b RLLQLCPDLHQSQSHLQPRL- 

C DYSSYARTSIRASLTFNRGF- 

TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACA 

2941 + + + + + + 3000 

AGTTCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCCGACTTCACAGTGT 

a SRLGGTCVANSLGSCG*SVT 

b QGWEEHASQTLWGLAAEVSQ- 

C KAGRNMRRKLFGVLRLKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGA 

3001 + + + + + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 

a ACFWICR*TASRRCAPTSTR 

b PVSGFAGEQPPDGVHQHLQD- 

c LFLDLQVNSLQTVCTNIYKI- 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGC 

3061 + + + + + + 3120 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

a SSCCRRTGFTHVCCSSHFIS 

b PPAAGVQVSRMCAAAPISSA- 

C LLLQAYRFHACVLQLPFHQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCT 

3121 + + + + + + 3180 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGGAGGGAGACGA 

a KFGRTPHFSCASSLTRPPSA 

b SLEEPHIFPARHL*HGLPLL- 

C VWKNPTFFLRVISDTASIiCY- 

ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCC 

3181 + + + -f + + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTCCCGCGGCGGCCGG 

a TPS*KPRTQGCRWGPRAPPA 

b LHPESQERRDVAGGQGRRRP- 

C SILKAKNAGMSLGAKGAAGP- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACTGAG 

a LCPPRPCSGCATKHSCSS*Li 

b SALRGRAVAVPPSIPAQADS- 

C S E A V Q_ W L C H. _Q__A_F__ L L K L R_= 



GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

a DTVSPTCHSWGHSGQPRRS* 

b TPCHLRATPGVTQDSPDAAE- 

C HRVTYVPLLGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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a VGSSRGRR*LPWRPQPTRHC 

b SEAPGDDADCPGGRSQPGTA- 

C RKIiPGTTLTALEAAANPALP- 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGC^ 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ- 

C SDFKTILD*WPPAHSQA'ESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 

3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSSPVTPGSTSQGGRGGPH 

b TPAALSRRALRPREGGAAHT- 

c HQQPCHAGLYVPGREGRPTP- 

CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGT 

3541 + + + -f + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTACA 

a PGPHRWESEA*VSVWPRPAC 

b QARTAGSLRPE*VFGRGLHV- 

C RPAPLGV*GLSECLAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTG 

3601 -f + + + + + 3660 

GGCCGACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACTCAC 

a PAEG*VSG*GLSECPAKG*V 

b RLKAECPAEA*ASVQPRAEC- 

C G. *RLSVRLRPERVSSQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCC 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

a SSTPAVFTSPQAGARLHPRA 

b PAHLPSSLPHRLALGSTPGP*- 

C QHTCRLHFPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 

3721 + + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFS SPGARLPLPT*E*SIPR 

b AFPHQEPGFHSPHRNSPSPD- 

C L__Z L T R S P _A S ^T. _P_ _H„ I__ G _V__H._ P Q I_ _- 



TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

C RHCSPLALPSFAFHPHHPGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTG 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 
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a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC- 

C DPEKDPGSSGNLE*PKVCPV- 

TACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGT 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

a YTGEDPAPGWGSLWVKLGGG 

b TQARTLHLDGGPCGSNWGEV- 

C HRRGPCTWMGVPVGQIGGRC- 

GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAA 
3961 + + + + + -f 4020 

a AVGVKY*IYEFFSFEKKKKK 

b LWE*NTEYMSFSVLKKKKKK- 

C CGSKILNI*VFQF*KKKKKK- 

AAAAAAAAA 

4021 4029 

TT T TTTT 

a K K K - 

b K K - 

c K K - 
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1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC OCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

^ _^ 16 0 ^ 



leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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leu gly cys 
CTG GGA TGC 



val pro leu 
GTC CCC CTG 



ser ala ser 
AGT GCC AGC 



ala ala pro 
GCT GCC CCT 



ala his pro 
GCC CAC CCG 



val val ser 
GTG GTG TCA 

gly ala leu 
GGT GCG CTC 



gin his his 
CAG CAC CAC 



trp asp thr 
TGG GAC ACG 



leu tyr ser 
CTC TAC TCC 



leu ser ser 
CTC AGC TCT 



glu thr ile 
GAG ACC ATC 



arg arg leu 
CGC AGG TTG 



leu phe leu 
CTG TTT CTG 



val leu leu 
GTG CTC CTC 



00 

glu arg ala 
GAA CGG GCC 



gly leu pro 
GGC CTG CCA 

230 

arg ser leu 
CGA AGT CTG 



glu pro glu 
GAG CCG GAG 

260 

gly arg thr 
GGC AGG ACG 



pro ala arg 
CCT GCC AGA 
290 

ser gly thr 
TCT GGC ACG 



ala gly pro 
GCG GGC CCC 

320 

pro cys pro 
CCT TGT CCC 



ser gly asp 
TCA GGC GAC 

350 

leu arg pro 
CTG AGG CCC 



phe leu gly 
TTT CTG GGT 

380 

pro arg leu 
CCC CGC CTG 



glu leu leu 
GAG CTG CTT 

410 

lys thr his 
AAG ACG CAC 



trp asn his 
TGG AAC CAT 

220 

ala pro gly 
GCC CCG GGT 



pro leu pro 
CCG TTG CCC 

250 

arg thr pro 
CGG ACG CCC 



arg gly pro 
CGT GGA CCG 

280 

pro ala glu 
CCC GCC GAA 

arg his ser 
CGC CAC TCC 

310 

pro ser thr 
CCA TCC ACA 



pro val tyr 
CCG GTG TAC 

340 

lys glu gin 
AAG GAG CAG 



ser leu thr 
AGC CTG ACT 

370 

ser arg pro 
TCC AGG CCC 



pro gin arg 
CCC CAG CGC 

400 

gly asn his 
GGG AAC CAC 



cys pro leu 
TGC CCG CTG 



ser val arg 
AGC GTC AGG 



ala arg arg 
GCG AGG AGG 



lys arg pro 
AAG AGG CCC 



val gly gin 
GTT GGG CAG 



ser asp arg 
AGT GAC CGT 



glu ala thr 
GAA GCC ACC 

his pro ser 
CAC CCA TCC 



ser arg pro 
TCG CGG CCA 



ala glu thr 
GCC GAG ACC 



leu arg pro 
CTG CGG CCC 



gly ala arg 
GGC GCT CGG 



trp met pro 
TGG ATG CCA 



tyr trp gin 
TAC TGG CAA 



ala gin cys 
GCG CAG TGC 



arg ala ala 
CGA GCT GCG 



210 

glu ala gly 
GAG GCC GGG 



arg gly gly 
CGC GGG GGC 

240 
arg arg gly 
AGG CGT GGC 



gly ser trp 
GGG TCC TGG 

270 
gly phe cys 
GGT TTC TGT 



ser leu glu 
TCT TTG GAG 
300 

val gly arg 
GTG GGC CGC 



pro arg pro 
CCA CGT CCC 

330 
lys his phe 
AAG CAC TTC 



ser phe leu 
TCC TTC CTA 

360 
arg leu val 
AGG CTC GTG 



gly thr pro 
GGG ACT CCC 



390 

met arg pro 
ATG CGG CCC 



pro tyr gly 
CCC TAC GGG 

420 
val thr pro 
GTC ACC CCA 
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430 



ala 
GCA 


ala 
GCC 


gly 

GGT 


val 
GTC 


cys 
TGT 


ala 
GCC 


arg 
CGG 


glu 
GAG 


lys 
AAG 


pro 
CCC 


gin 
CAG 


gly 
GGC 


ser 
TCT 


val 
GTG 


ala 
GCG 


ala 
GCC 


pro 

ccc 


glu 
GAG 


440 
glu 
GAG 


glu 
GAG 


asp 
GAC 


thr 
ACA 


asp 
GAC 


pro 
CCC 


arg 
CGT 


arg 
CGC 


leu 
CTG 


val 
GTG 


450 

gin 

CAG 


leu 
CTG 


leu 
CTC 


arg 
CGC 


gin 
CAG 


his 
CAC 


ser 
AGC 


ser 
AGC 


pro 
CCC 


trp 
TGG 


460 
gin 
CAG 


val 
GTG 


tyr 
TAC 


gly 
GGC 


phe 
TTC 


val 
GTG 


arg 
CGG 


ala 
GCC 


cys 
TGC 


leu 
CTG 


470 
arg 
CGC 


arg 
CGG 


leu 
CTG 


val 
GTG 


pro 
CCC 


pro 
CCA 


gly 
GGC 


leu 
CTC 


trp 
TGG 


gly 
GGC 


480 
ser 
TCC 


arg 
AGG 


his 
CAC 


asn 
AAC 


glu 
GAA 


arg 
CGC 


arg 
CGC 


phe 
TTC 


leu 
CTC 


arg 
AGG 


490 
asn 
AAC 


thr 
ACC 


lys 
AAG 


lys 
AAG 


phe 
TTC 


ile 
ATC 


ser 
TCC 


leu 
CTG 


gly 
GGG 


lys 
AAG 


500 
his 
CAT 


ala 
GCC 


lys 
AAG 


leu 
CTC 


ser 
TCG 


leu 
CTG 


gin 

CAG 


glu 
GAG 


leu 
CTG 


thr 
ACG 


510 
trp 
TGG 


lys 
AAG 


met 
ATG 


ser 
AGC 


val 
GTG 


arg 
CGG 


asp 
GAC 


cys 
TGC 


ala 
GCT 


trp 
TGG 


520 
leu 
CTG 


arg 
CGC 


arg 
AGG 


ser 
AGC 


pro 
CCA 


gly 
GGG 


val 
GTT 


gly 
GGC 


cys 
TGT 


val 
GTT 


530 
pro 
CCG 


ala 
GCC 


ala 
GCA 


glu 
GAG 


his 
CAC 


arg 
CGT 


leu 
CTG 


arg 
CGT 


glu 
GAG 


glu 
GAG 


540 
ile 
ATC 


leu 
CTG 


ala 
GCC 


lys 
AAG 


phe 
TTC 


leu 
CTG 


his 
CAC 


trp 
TGG 


leu 
CTG 


met 
ATG 


550 
ser 
AGT 


val 
GTG 


tyr 
TAC 


val 
GTC 


val 
GTC 


glu 

GAG 


leu 
CTG 


leu 
CTC 


arg 
AGG 


ser 
TCT 


c n 
D b U 

phe 
TTC 


phe 
TTT 


tyr 
TAT 


val 
GTC 


thr 
ACG 


glu 

GAG 


thr 
ACC 


thr 
ACG 


phe 
TTT 


gin 
CAA 


o / U 
lys 
AAG 


asn 
AAC 


arg 
AGG 


leu 
CTC 


phe 
TTT 


phe 
TTC 


tyr 
TAC 


arg 
CGG 


pro 
CCG 


ser 
AGT 


580 
val 
GTC 


trp 
TGG 


ser 
AGC 


lys 
AAG 


leu 
TTG 


gin 
CAA 


ser 
AGC 


ile 
ATT 


gly 
GGA 


ile 
ATC 


590 
arg 
AGA 


gin 
CAG 


his 
CAC 


leu 
TTG 


lys 
AAG 


arg 

.AGG. 


val 
GTG 


gin 
CAG 


leu 
CTG 


arg 
CGG 


600 
glu 
GAG 


leu 
CTG 


ser 
TCG 


glu 
GAA 


ala 
GCA 


glu 
GAG 


val 
GTC 


arg 
AGG 


gin 

CAG 


his 
CAT 


o ± U 
arg 
CGG 


glu 

GAA 


ala 
GCC 


arg 
AGG 


pro 
CCC 


ala 
GCC 


leu 
CTG 


leu 
CTG 


thr 
ACG 


ser 
TCC 


620 
arg 
AGA 


leu 
CTC 


arg 
CGC 


phe 
TTC 


ile 
ATC 


pro 
CCC 


lys 
AAG 


pro 
CCT 


asp 
GAC 


gly 
GGG 


630 
leu 
CTG 


arg 

CGG 


pro 
CCG 


ile 
ATT 


val 
GTG 


asn 
AAC 


met 
ATG 


asp 
GAC 


tyr 
TAC 


val 

GTC 


640 
val 
GTG 


gly 
GGA 


ala 
GCC 


arg 
AGA 


thr 
ACG 


phe 
TTC 


arg 
CGC 
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650 660 
^3rg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 

770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

800 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 



830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 
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880 



leu 
TTG 


val 
GTG 


thr 
ACA 


pro 
CCT 


his 
CAC 


leu 
CTC 


thr 
ACC 


his 
CAC 


ala 
GCG 


lys 
AAA 


thr 
ACC 


phe 
TTC 


leu 
CTC 


arg 
AGG 


thr 
ACC 


leu 
CTG 


val 
GTC 


arg 
CGA 


890 
gly 
GGT 


val 
GTC 


pro 
CCT 


glu 
GAG 


tyr 
TAT 


gly 
GGC 


cys 
TGC 


val 
GTG 


val 
GTG 


asn 
AAC 


900 
leu 
TTG 


arg 
CGG 


lys 
AAG 


thr 
ACA 


val 
GTG 


val 
GTG 


asn 
AAC 


phe 
TTC 


pro 
CCT 


val 
GTA 


910 
glu 
GAA 


asp 
GAC 


glu 
GAG 


ala 
GCC 


leu 
CTG 


gly 
GGT 


gly 
GGC 


thr 
ACG 

gly 
GGC 


ala 
GCT 

leu 
CTG 


phe 
TTT 

leu 
CTG 


920 
val 
GTT 

leu 
CTG 


gin 
CAG 

asp 
GAT 


met 
ATG 

thr 
ACC 


pro 
CCG 

arg 
CGG 


ala 
GCC 

thr 
ACC 


his 
CAC 
940 
leu 
CTG 


gly 
GGC 

glu 
GAG 


leu 
CTA 

val 
GTG 


phe 
TTC 

gin 
CAG 


pro 
CCC 

ser 
AGC 


930 
trp 
TGG 

asp 
GAC 


cys 
TGC 

tyr 
TAC 


ser 
TCC 


ser 
AGC 


tyr 
TAT 


950 
ala 
GCC 


arg 
CGG 


thr 
ACC 


ser 
TCC 


ile 
ATC 


arg 
AGA 


ala 
GCC 


ser 
AGT 


val 
GTC 


thr 
ACC 


960 
phe 
TTC 


asn 
AAC 


arg 
CGC 


gly 
GGC 


phe 
TTC 


lys 
AAG 


ala 
GCT 


gly 
GGG 


arg 
AGG 


asn 
AAC 


970 
met 
ATG 


arg 
CGT 


arg 
CGC 


lys 
AAA 


leu 
CTC 


phe 
TTT 


gly 
GGG 


val 
GTC 


leu 
TTG 


arg 
CGG 


980 
leu 
CTG 


lys 
AAG 


cys 
TGT 


his 
CAC 


ser 
AGC 


leu 
CTG 


phe 
TTT 


leu 
CTG 


asp 
GAT 


leu 
TTG 


990 
gin 
CAG 


val 
GTG 


asn 
AAC 


ser 
AGC 


leu 
CTC 


gin 
CAG 


thr 
ACG 


val 
GTG 


cys 
TGC 


thr 
ACC 


1000 
asn ile 
AAC ATC 


tyr 
TAC 


lys 
AAG 


ile 
ATC 


leu 
CTC 


leu 
CTG 


leu 
CTG 


gin 
CAG 


ala 
GCG 


1010 
tyr arg 
TAC AGG 


phe 
TTT 


his 
CAC 


ala 
GCA 


cys 
TGT 


val 
GTG 


leu 
CTG 


gin 
CAG 


leu 
CTC 


1020 
pro phe 
CCA TTT 


his 
CAT 


gin 
CAG 


gin 
CAA 


val 
GTT 


trp 
TGG 


lys 
AAG 


asn 
AAC 


pro 
CCC 


1030 
thr phe 
ACA TTT 


phe 
TTC 


leu 
CTG 


arg 
CGC 


val 
GTC 


ile 
ATC 


ser 
TCT 


asp 
GAC 


thr 
ACG 


1040 
ala ser 
GCC TCC 


leu 
CTC 


cys 
TGC 


tyr 
TAC 


ser 
TCC 


ile 
ATC 


leu 
CTG 


lys 
AAA 


ala 
GCC 


1050 
lys asn 
AAG AAC 


ala 
GCA 


gly 
GGG 


met 
ATG 


ser 
TCG 


leu 
CTG 


gly 
GGG 


ala 
GCC 


lys 
AAG 


1060 
gly ala 
GGC GCC 


ala 
GCC 


gly 
GGC 


pro 
CCT 


leu 
CTG 


pro 
CCC 


ser 
TCC 


glu 
GAG 


ala 
GCC 


1070 
val gin 
GTG CAG 


trp 
TGG 


leu 
CTG 


cys 
TGC 


his 
CAC 


gin 
CAA 


ala 
GCA 


phe 
TTC 


leu 
CTG 


1080 
leu lys 
CTC AAG 


leu 
CTG 


thr 
ACT 


arg 
CGA 


his 
CAC 


arg 
CGT 


val 
GTC 


thr 
ACC 


tyr 
TAC 


1090 
val pro 
GTG CCA 


leu 
CTC 


leu 
CTG 


gly 
GGG 


ser 
TCA 


leu 
CTC 
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1100 1110 
arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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